Approximating Phonotactic Input in Children's Linguistic Environments from Orthographic Transcripts
نویسندگان
چکیده
Child-directed spoken data is the ideal source of support for claims about children’s linguistic environments. However, phonological transcriptions of child-directed speech are scarce, compared to sources like adult-directed speech or text data. Acquiring reliable descriptions of children’s phonological environments from more readily accessible sources would mean considerable savings of time and money. The first step towards this goal is to quantify the reliability of descriptions derived from such secondary sources. We investigate how phonological distributions vary across different modalities (spoken vs. written), and across the age of the intended audience (children vs. adults). Using a previously unseen collection of Swedish adultand child-directed spoken and written data, we combine lexicon look-up and graphemeto-phoneme conversion to approximate phonological characteristics. The analysis shows distributional differences across datasets both for single phonemes and for longer phoneme sequences. Some of these are predictably attributed to lexical and contextual characteristics of text vs. speech. The generated phonological transcriptions are remarkably reliable. The differences in phonological distributions between child-directed speech and secondary sources highlight a need for compensatory measures when relying on written data or on adult-directed spoken data, and/or for continued collection of actual child-directed speech in research on children’s language environments.
منابع مشابه
Phonotactic probabilities in young children's speech production.
This research explores the role of phonotactic probability in two-year-olds' production of coda consonants. Twenty-nine children were asked to repeat CVC non-words that were used as labels for pictures of imaginary animals. The CVC non-words were controlled for their phonotactic probabilities, neighbourhood densities, word-likelihood ratings, and contained the identical coda across low and high...
متن کاملPhonotactic probabilities at the onset of language development: speech production and word position.
PURPOSE To examine the role of phonotactic probabilities at the onset of language development, in a new language (Dutch), while controlling for word position. METHOD Using a nonword imitation task, 64 Dutch-learning children (age 2;2-2;8 [years;months]) were tested on how they imitated segments in low- and high-phonotactic probability environments, in word-initial and word-final position. The...
متن کاملThe Qur'an Lexicon Project: A database of lexical statistics and phonotactic probabilities for 19, 286 contextually and phonetically transcribed types in Qur'anic Arabic
Reciting and memorizing the Qur’an forms a major part of religious practice for 1.6 billion Muslims around the world; in non-Arabic-speaking Muslim communities, it also provides Muslim speakers of other languages with their first exposure to the Arabic script and language. However, little research has been completed regarding the psycholinguistic processing of Qur’anic Arabic. In this paper, we...
متن کاملImplicit learning of phonotactic constraints: Transfer from perception to production
This study asked whether new linguistic patterns acquired through recent perception experience can transfer to speech production. Participants heard and spoke sequences of syllables featuring novel phonotactic constraints (e.g. /f/ is always a syllable onset, /s/ is always a syllable coda). Participants’ speech errors reflected weaker learning of the constraints present in the spoken sequences ...
متن کاملThe sensitivity of children with SLI to phonotactic probabilities during lexical access.
UNLABELLED The procedural deficit hypothesis (Ullman & Pierpont, 2005) has been proposed to account for the combination of linguistic and nonlinguistic deficits observed in specific language impairment (SLI). According to this proposal, SLI results from a deficit in procedural memory that prevents children from developing sensitivity to probabilistic sequences, amongst other deficits. We tested...
متن کامل